Integrated Multilingual Speech Recognition Impact on Chinese Spoken Language Processing
نویسنده
چکیده
The notion of integrated multilingualism is introduced as the basis for a novel approach to multilingual speech recog nition This approach enables training of the recognizer using the data from only source language s The trained recognizer is nevertheless deployable directly to new tar get languages The performance of the recognizer is incre mentally improved via a language adaptation strategy In this paper an overview is provided of a language universal feature based phonological model and of a dynamic pho netic model that in combination constitute the general framework of integrated multilingual speech recognition Some related speech recognition experiments in connec tion with this framework are described Finally impact of the new approach to Chinese spoken language processing is analyzed in terms of the need to develop a successful and uniform speech recognition strategy for a wide variety of Chinese dialects with little or no training data required for the dialects INTRODUCTION INTEGRATED
منابع مشابه
A Multilingual Spoken Dialog System
This paper will briefly introduce MSDSKIT-1 (Multilingual Spoken Dialogue System Version 1.0 developed by Kyoto Institute of Technology) which integrates Japanese and Chinese now. It is a promotion vision of the SDSKIT-3 (Spoken Dialogue System in Japanese). This system can provide services such as sight-seeing introduction, traffic guidance, hotel reservation. A user can also plan his itinerar...
متن کاملMultilingual Spoken Language Corpus Development for Communication Research
Multilingual spoken language corpora are indispensable for research on areas of spoken language communication, such as speech-to-speech translation. The speech and natural language processing essential to multilingual spoken language research requires unified structure and annotation, such as tagging. In this study, we describe an experience with multilingual spoken language corpus development ...
متن کاملSpoken language processing in a multilingual context
In this paper we overview the spoken language processing activities at LIMSI, which are carried out in a multilingual framework. These activities include speech-to-text conversion, spoken language systems for information retrieval, speaker and language recognition, and speech response. The Spoken Language Processing Group has also been actively involved in corpora development and evaluation. Th...
متن کاملTowards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System
This paper outlines the recent development on multilingual medical data and multilingual speech recognition system for network-based speech-to-speech translation in the medical domain. The overall speech-to-speech translation (S2ST) system was designed to translate spoken utterances from a given source language into a target language in order to facilitate multilingual conversations and reduce ...
متن کاملFast bootstrapping of LVCSR systems with multilingual phoneme sets
In this paper we described an e cient method to bootstrap continuously spoken, large vocabulary speech recognition systems by multilingual phoneme sets. To evaluate this techniques we collected the multilingual database GlobalPhone which currently consists of 9 di erent languages. A multilingual recognizer (MULTI) based on the four languages German, English, Japanese and Spanish was developed t...
متن کامل